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Abstract — We propose a new model predictive control (MPC) 
approach which is completely based on an observer for the state 
system. For this, we show semiglobally practically asymptotic 
stability of the closed-loop for an abstract observer and 
illustrate our results for a numerical example. 

I. Introduction 

Nowadays, a popular control technique for digital control 
problems is model predictive control (MPC). Within such a 
method, a prediction of the state trajectory over finite horizon 
is used for the optimization which is based on estimates 
of the exact initial state. Then, the first control value is 
implemented and the problem is shifted forward in time. 
For these systems we can not expect global stabilization by 
output feedback, see e.g. [10] for a counterexample. Yet, 
using suitable controllability and observability assumptions, 
semiglobal practical asymptotic stabilizability of the system 
can be guaranteed [15]. 

In particular the actual state and its estimate do not have 
to coincide. To avoid the possible divergence of the system 
Shim and Teel split the available computing time in [15]. 
First they calculate a good approximation of the actual state 
using some kind of high gain observer and implement a 
feedback based on this estimate. The generated delays and 
errors, however, are handled by robustness of the controller 
itself and are not integrated in the setup. 
Here our aim is to incorporate these aspects within the 
controller design. To this end we consider the by now well 
know concept of model predictive controllers MPC in a 
sampled-data fashion. 

Within the standard setting of MPC with state estimation one 
utilizes an external approximation of the state and minimizes 
a given cost functional according to the development of a 
simulated system with initial values based on the external 
estimates. This is carried out by appropriately choosing a 
control sequence for this simulation over a finite horizon. 
Then the computed control value is implemented at a future 
time instant and the procedure is repeated, see e.g. [1], [5]- 
[7], [11]. 

Since the internally predicted trajectory is based on estimated 
and not exact state information, it may deviate from the 
real one. Even worse, the corresponding control may even 
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be counterproductive considering the desired behavior. In 
general, this prediction yields a poor estimate of the state 
at the implementation point, especially if we compare it to 
the approximation using the (already given) state estimation 
procedure since such an observer was designed for this task. 
Hence, this additional knowledge of the system is ignored in 
this setting instead of utilizing it to construct a comparatively 
more consistent computation basis. 

However, simply integrating this approximation into the 
MPC concept naturally leads to a time acausality, that is 
future measurements are necessary to generate the estimate, 
see [14] for details. To overcome this deficiency and at 
the same time to analytically handle the delays and errors 
occurring in the output-feedback emulation design of [15] 
we propose a nonstandard MPC closed-loop scheme. Here 
the prediction over the control horizon is calculated not 
by using a model of the plant and some estimated initial 
value, but by making use of a retarded observer. Hence, the 
resulting control law is computed according to a simulation 
of the observer approximating the real (future) state of the 
plant instead of the forward prediction of a model of the 
plant which may diverge. Therefore the control is calculated 
more consistently and an improved behavior of the result- 
ing closed-loop can be expected. Moreover we avoid the 
necessity of splitting the available computing time between 
observer and controller, i.e. guaranteeing observability and 
stabilization, since these parts are merged here within one 
computing step. 

The main goal of this paper is to provide a mathematically 
rigorous stability analysis of the proposed scheme. Section 
HT1 describes the setup and formalize the MPC scheme we 
propose. In Section [HI] we show asymptotic stability for our 
proposed MPC scheme. Finally, we illustrate our setup using 
an example in Section [V] and draw some conclusions in 
Section [V] 

II. Problem formulation 
Consider a nonlinear plant 



x(t) = f{x(t),u(t)), y(t) = h(x(t)) 



(1) 



with vector field / : W x U -> M". Here x(t) G M" denotes 
the state variable, u G U C K m the exogenous control 
variable and y G M. p the output. Now we want to stabilize the 
origin based on the structural information available from the 



plant and the measured outputs using a digital computer with 
sampling and zero order hold at the sampling time instants 

t k = k -T, k G N, T e R >0 . 

In order to achieve that we construct an observer system 



(2) 



with 6 > which is based on past information of the 
output of the plant (Q~|). Using (O we calculate a stabilizing 
sampled-data feedback Ut (£(£)) with zero order hold using 
a MPC algorithm, see e.g. [6]. Hence our control law will 
be based entirely on the observer system and the available 
measurements. 

Fig. 1. Scheme of the considered control system 
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R" x M m -> M> stage cost and F : R n -> M >0 terminal 
cost. 

Now we implement the first element do of the sequence 
W[ jv-i] into the state system (Q3. The resulting output values 
y{r), t G [to, to + T] are measured and stored. Then we can 
continue with the usual MPC procedure, shift the horizon 
of the open-loop optimization in a receding horizon fashion 
and iterate these steps. The resulting control will be denoted 
by 



0) 



(5) 



(i) 

where u represents the first element of the control se- 
quence of the j-th iterate. 

For reasons of simplicity we assume that the infimum in 
© is attained. Additionally we assume f(x, u) to be locally 
Lipschitz in x and h : R™ — > R p to be locally Lipschitz with 
h(0) = 0. 

Remark 2.1: Within the standard MPC setting, the feed- 
back is computed based on an estimate of the state, see e.g. 
[9], [11]. To obtain a minimizing control, however, a model 
of the plant is simulated forward in time. Since there are 



Next we use the constructed feedback to control the plant 
(Q}, and hence the sampled-data closed-loop system is given 
by 

x{t) = /(x(t), UT (efe))), t g \tj,t j+1 ). 

Our goal is to show that we can modify the usual model 
predictive control algorithm to an observer based predictive 
control algorithm (OBPC) which semiglobally stabilizes the 
system (|T). 

To this end we compute the observer trajectory according to 



Fig. 2. Standard scheme for MPC 



£(t) = 9(m, £(* - NT), y(t - NT),vj) 



(3) 



for t G [tj,tj+i), j G {0, ...,N - 1}. The solution of 
the system (01 at time t will be denoted by £o(0> v) 
or given the initial values £(0), 9 G [to - NT, to], and 
control sequence Ufo ...,jv-i] G U . Also we will assume 
that a unique solution of the observer system g (•,-,-,•) : 
1" x R" xW } x M m R" exists for any initial value in 
a given compact neighborhood Af containing the origin in 
the first and second argument. Additionally we assume that 
y{r), t G [to — NT, to], are known output values of the plant. 
Based on system (O we calculate an open-loop optimal 
control of zero order hold 

U[ ,N-l] = arginf Jjv(^(*o), ^[o,jv-i]) ( 4 ) 
u [o,jv-i] 

such that the cost functional 

N-l 

):=E / Km^dt + F(£(t N )) 

3=0 / 

is minimized. Similar to standard MPC the length of the 
horizon [to, to + NT] is fixed by the sampling time T G 
R>o and the parameter N G N. Analogously we call / : 
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no restraints on the plant, the initial error may — in the 
worst case — be amplified by the factor e L ' NT where L 
denote the local Lipschitz constant of the plant and NT the 
simulated horizon. Hence, a control computed on this basis 
is not computed consistently and the resulting closed-loop 
behavior may be far from optimal. 

Comparing Figures Q] and [2] reveals the difference between 
the proposed and the standard setting of MPC: Here, we 
do not simulate the plant itself but use the observer for 
the prediction within the MPC algorithm. The feedback is 
therefore not depending on the state but on an observer of the 
state which is time-retarded. Hence, if the retarded observer 
approximates the state of the plant, the control is computed 
based on a simulation converging towards the actual state of 
the plant instead of a possibly diverging one. For this reason 
we expect an enhanced behavior of the closed-loop. 

Remark 2.2: In practice we cannot measure and store 
the output values for all r G [to, to + T]. For practical 
applications one can overcome this problem as follows: 
i) Introduce a second (output) sampling time T <C T such 
that only y{r\, r = to + if, i = 0,...,M with MT < 
T < [M + 1)T are stored. However, if this sampling time is 
sufficiently small we can use results from [12] to guarantee 
that, given certain consistency and stability conditions, a 



control u which stabilizes 

i(t) = 9(m, Z(t 3 - NT), y(t 3 - NT), u(t)) (6) 

for t 6 [tj,tj+x), tj = to + jT also stabilizes (0. 
Conversely, it was shown in [8] that if the system (0 is 
stabilized by u(t) for a continuous y and if the sampling time 
is sufficiently small then also the in y "emulated system" (0 
is stabilized. 

ii) Alternatively one can use an interpolation procedure to 
approximate the past continuous output y based on measure- 
ments at the sampling instants tj. However, if the sampling 
time is large then one still needs to introduce a finer output 
grid to obtain reliable information. 

In order to define stability of the closed-loop system, i.e. the 
plant in Figure Q] given by 

x(t) = f(x(t), ut(£(*j)))> V(t) = &(*(*)) (7) 
m = <?(£(*), S(t - NT),y(t - NT), ur(e(*i))) 

for t G [tj,tj + i) where ut is given by (0, we introduce the 
notion of comparison functions: A function 7 : K>o — > R>o 
is of class K. if it is continuous, zero at zero and strictly 
increasing. It is of class Koo if it is also unbounded. A 
function belongs to class C if it is strictly positive and it 
is decreasing to zero as its argument tends to infinity. Last, 
a function f3 : R>o X R>o — ► K>o is called /(X-function if 
for every fixed t > the function f3{-,t) is an element of 
class K, and for each fixed s > the function (3(s, •) is of 
class C. 

Here we seek stability of the closed-loop in the following 
sense: 

Definition 2.3 (Semi-global practical Stability): 
Consider a control system 

x(t) = F(x(t), x(t - NT), u T ) 

where ^(0,0,0) = and a family of control functions ut 
for T e (0,T*]. If there exists a function /3 £ ICC and a 
pair (Ai, A2) of positiv real-valued numbers such that the 
inequality 

\\x(t,x ,u T )\\ < max{/3(||x ||,t), A 2 } 

holds for all t > to and all initial values xq{6) G (0), 
9 G [to — NT, to], then the origin is called semi-globally 
practically asymptotically stabilizable. 
This definition includes the standard definition of semi- 
globally practically asymptotically stability if we consider 
F to be independent of past information. Additionally we 
can use it in the context of the retarded observer system (f3) 
and the combined system ©. 

III. Stability of the closed-loop 

We now show the stability of our proposed scheme. Note 
that if we applied an observer which is not retarded we would 
require the measured output y at times t G [to, to+NT], see 
[4] for methodology and [2], [3], [13] for necessary/sufficient 
existence conditions of observer based output-feedbacks. 
Since this represents a time interval in the future we lack 



these values. In order to overcome this acausality we do not 
follow the way described in [14] and use the special structure 
of the ENOCF observer but retard our observer, see [16] for 
an observer construction in the linear case. To show stability 
of our scheme we consider an abstract observer, i.e. with no 
special structure. Here we will only assume that the observer 
possesses the following properties: 

(Al) If 6,(0) = x(9), 6 G [-NT,0], T > 0, N e N 
holds for the history of x and £, then £(t, £o(")j u t) = 
x(t, Xq, u T ) holds for all £ > and all u T G U N . 
(A2) If £(0) ^ i(0) there exists a function f3 G ICC 
such that 

\\x(t, x , u T ) - &(•)» «r)ll < HMO) - £(0)||, t) 
holds for the estimation error for all ut G U n . 
Based on this observer we generate a feedback which will 
give us the stability of the interconnected system. 

Theorem 3.1 (Observer-based Feedback): 
Given constants v > (1 + a)Ai, a > 0, we consider the 
combined system (0 

x(t) = f(x(t), ur(t)), y(t) = h(x(t)) (8) 

and its associated observer system 

i(t) = g(Z(t),ttt - NT),y(t - NT),u T (t)) (9) 

(0 satisfying the previous assumptions. Additionally initial 
values £(8) and x{6) are given which satisfy ||x(0)— < 
v for all 9 G [to — NT, to] ■ Moreover we assume that (0 is 
(Ai, A2)-semiglobally practically asymptotically stabilized 
by the feedback control law 

u T (t) := »(£(t), £(t - NT), y(t - NT)). (10) 

for all possible outputs y G M p . Then the combined system 
(0, (0 is (Ai, A2)-semiglobally practically asymptotically 
stabilized by the feedback law ( fTOb with Ai := v — Ai and 
A 2 := 4A 2 . 

Proof: Since (0 is (Ai, A2)-semiglobally practically 
asymptotically stable for the chosen control there exists an 
attraction rate /3 G ICC and a pair (Ai, A2) of positive real- 
valued numbers such that 

\Ut)\\< max {^(||^o)ll> t),A 2 } 

for all t > t and £(6>) G 5^(0), 9 G [to - NT, to]. Using 
the triangle inequality and (A2) we obtain 

\\x(t)\\<u(t)\\ + ut)-m\\ ai) 

<max{/3(||£(t )||,t),A 2 } + /3(||.T(to)-e(<o)||,t). 
If we consider a mixed 1-norm of this vector 

ll£(*)ll 

as a function of time we get 

p(t) <max{^(||e(*o)||,t),A 2 } + i 8(||a:(to)-e(to)||,*) 
+ max{/3(||e(<o)||,t),A 2 } 
< 2m a x{p{p(t ),t),A 2 } + p(\\x(t )\\ + U(t Q )\\,t) 
= 2 max {0 (p(t ), t) , A 2 } + p{p{t ), t). (12) 



Pit) 



We can conclude by ||x(io) — £(io)|| < v, v > (1 + a)Ai, 
a > 0, and £(0) G 5 Al (0), 9 G [t - NT, to], that these 
conditions can only be satisfied for all x(6) G B^_a 1 (0), 
6* G [to — NT, to] . Since a > this is a nonempty closed 
set and we can restrict ourselves to the set B®_ A (0), in 
particular choose 

Ai :=v - Ai (13) 

and conclude x(t ) G (0) C B°(0). 
If p(to) — we are done since x(i ) = £(io) = and due 
to the /C£-property of /3 the error will be zero for all t > t . 
Hence we can identify the trajectories of £(t) = x(t) for all 
t > t , set 



A, := A 2 



(14) 



and get 



\x(t)\\ <max{/?(||x(f )||,t),A 2 } 



for all t > to and all x(t ) G B^ (0). 
If p(to) / Owe want to obtain 

p(t) <max{^(p(t ),t),A 2 } 

To this end we consider ( TT~2T > for any fixed t > to- Then one 
of the following cases holds: 

(1) /3 (p{to),t) > A 2 : Using this in ([T2| it follows that 

p(t) < 2(3(p(to) : t)+p(p(to),t) 
and we can directly define 

f 1] {r,t) :=2/3(r,t)+p(r,t). 

(2) /3(p(t ),t) < A 2 : From (EJ we can see 

p{t) <2A 2 +p{p{to),t) 

Now we consider the following cases: 

(2a) (3(p(t ),t) < 2A 2 : Here we get p(t) < 4A 2 and 
we can use 

A ( 2 2) := 4A 2 . 

(2b) f}(p(t ),t) > 2A 2 : In this case we get p(t) < 
2/3(p(t ),t) and define 

f 2 \r,t) :=2p(r,t). 
Using these results we see that 

p(t) <max{^ (1) (p(t ),t) ,f 2) (p(to),t) , Af) . 

Now we can simplify this inequality since the maximum of 
two /CX-functions is still a /C£-function. Hence the stated 
inequality holds for all t > to if we use 



Also there exist T, f >t such that 

/Wo),t)|-5 2 ' t0 ~ t<T 
' I < ^ t >T 



and 



HU(to)\\,t) 



> A 2 
< A 2 



t <t<T 
t > T 



Hence the trajectory of the x-system will stay within (0) 

for t > max{T,f}. ' ■ 

Remark 3.2: The assumptions regarding the observer sys- 
tem (0 in Theorem 13 4 1 are stronger then ISS. However, we 
suppose that a generalization to ISS is possible. 

Remark 3.3: The stated bounds can be tightened at a 
certain cost. If one considers (ITTb and the estimate ||£(£o) II = 
||k(*o)|| + v we get ||x(t)|| < /8(2||x(*o)|| + v, t) + 
max {/3(2||x(io) || + v i t), A 2 }. Now we have to consider the 
cases 

(i)Hto)||>e (ii) ||x(i )|| <e 

where < e < v. In case (i) there exists a A G K: v — \e < 
A||x(io)|| and it follows that 

||x(*)|| <max{^(||x(t )||,£),A 2 } 

where 0(r, t) := /3((2 + X)fir, t)+/3((2 + A)r, t) and A 2 := 
A 2 +£. Here p > 1 needs to be chosen as follows: Set e > 
arbitrary. Hence there exists a T > to such that 

P{{2 + A)||x(t )||,i) < H(2 + A)Si,t) *° " * < T 

I S t> 1 . 

and moreover a /i G R such that P(e,t) > A 2 + /9((2 + 
A)Ai,t) for all t < t < T. Hence there exists a f > T 
such that 



to <t <T 
t > f 



:=max{/3 (1) (r,i),/3 (2) (r,t)} 



A 2 := 4A 2 



for all ||x(to)|| > £ where /3 is a /C£-function and the stated 
stability conditions hold since we have only enlarged the 
right hand side. 

The drawback here is that in case (ii) we definitely need 
A 2 > /3 (e, 0) to guarantee semiglobally practically asymp- 
totic stability. To see this consider ||x(io)|| = 0. Then 
the invariance of the set {0} is violated and hence no 
/(X-function can be found. Hence semiglobally practically 
asymptotic stability can only be guaranteed if the maximum 
of the right hand side is always given by A 2 . 
Making use of the ^-property of the function j3 we can 
conclude that A 2 := f3(e,0) + A 2 and hence the trajectory 
of the x-system will stay within B^- (0). 

Remark 3.4: Note that by increasing a the basin of attrac- 
tion B^- (0) can be enlarged as long as the other conditions 
are still valid. 

Remark 3.5: One has to keep in mind that there exists a 
fixed delay between an occurring disturbance in the x-system 
at time t\ > to — NT and the reaction of the observer £ at 
time i 2 = t\ + NT. Since the length of this delay is exactly 
equal to the length of the OBPC horizon this disturbance 
will still be taken into account. 



IV. Example 

In order to check our proposed scheme we consider two 
simple examples 

x(t) = Aix(t) + Bu(t) (15) 
y(t) = Cx(t) 

where 




For these examples we constructed the Luenberger observers 

i(t) = M(t) + Bu(t) - A(A i )Jf i (C^(t) - y(t)) 

with 

A <« = (° 5) «■"*-(&)■ 

Using the parameter Ai = 1.2, k^i = 1 and fc^a = 0.5 the 
eigenvalues of (A — A(Xi)KiC) are negative and hence the 
observer converges. 

As a retarded observer we used the same construction with 
a time shift of the prediction horizon length r = NT in the 
measurement component, that is 

C(t) = M(t) + Bu(t) - A(Xi)Ki(C€{t - r) - y(t - r)). 
To show convergence we consider the Lyapunov candidate 

v( v (t)) = lv(t) T A- 1 wm- 1 Wv(t) 

for the error rj(t) = £(t) - x(t) with P, = > such that 

P(Ai - A(X)KiC) + (A, - A(A)K. i C7) T P 4 = -Id, 

holds. The error dynamic is given by 

i)(t) - ^(t) - A{X)K i Ci 1 {t - r). 

which gives us 

V(»j(t)) =r 1 (t) T W(X)(A t r 1 (t) - A(X)KiCrj(t - NT)). 

where Wj(A) := A _1 (A)PiA _1 (A). Using the integral equa- 
tion for r)(t — NT) we obtain 

t-NT 

T](t - NT) = r)(t) + J r)(r)dT 

= (Id + e- A >- NT Ht) + e A ( A )^ c - JVT ?7 (t - NT) 

The parameter A, k\ and were chosen such that (Id — 
e A(\)KiC-NT^ j s i nver tible, hence we obtain 

V( V (t)) = r 1 (t) T W(X)(A l - A(X)K t C 

(Id - e A(A)^CiVT ) -l (Id + e -ArNT^ t y 

Since W(X) is positive definite it remains to show that 
Ai :=(Ai - A(A)^C(Id - ^W^arryx 
(ld + e~ A - NT )) 



has negative eigenvalues which can be verified for both 
examples given the previously mentioned parameter. 
Moreover the value function is bounded from above 
since using the properties of A we have V(r)(t)) < 
^frj^ || f](t) || 2 =: o;2(||?7(i) ||) and similarly from below due 
to F(r?(t)) > %^hWII 2 =: axdhWID-Rgureishows 
the resulting trajectories for the standard MPC scheme using 
the initial values xq = (11,8) and £o = (0, 0). The sampling 
time is set to T — 0.1 and the horizon length parameter is 
taken to be N — 5 Here no optimization takes place within 



Fig. 3. Resulting trajectories for Example 1 using standard MPC 
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the first step of the MPC algorithm since the initial guess 
of the observer is set to zero. Note that the resulting control 
may even exhibit a wrong sign according to the observed 
state. The observer based predictive controller on the other 
hand recognizes this deviation, cf. Figure |4] Here we chose 
the initialization of the history of the system under control 
to be constant at (11,8). Moreover the transient behavior of 



Fig. 4. Resulting trajectories for Example 1 using OBPC 
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the observer steers the system state even stronger to zero 
since it points in negative direction first. Here one has to 
keep in mind that observer and system state changed places, 



hence the system state converges towards the observer state. 
Considering a variety of initial values the observer based 
variant seems to enhance the stabilization process for this 
example. 

Within our second example we do neither change any of the 
parameter nor the initial values. At first glance of Figures |5] 
and [6] one could suspect that the presented setup does not 
work properly for this example. Analyzing the background 
we see that the resulting swinging trajectory is due to the 
poor convergence rate of the retarded observer coming from 
our construction and is not due to the governing control 
setup. Particularly in contrast to our first example, the 
eigenvalues of the Lyapunov function matrix (W(X) ■ A) 
of the error development cannot be tuned arbitrarily. The 
Luenberger observer considered in the standard MPC setup, 
however, does not exhibit this difficulty and is advantageous 
for this reason. 



5. Resulting trajectories for Example 2 using standard MPC 




Fig. 6. Resulting trajectories for Example 2 using OBPC 




this error due to the retarded nature of the observer, however, 
may exceed any linear bound if the Lipschitz constant of the 
system is greater then one. 

The success of the presented setup will hence depend mas- 
sively on the availability of retarded observers which show 
a tunable rate of convergence. 

V. Conclusion 

In this work we have shown the stability analysis of the 
newly proposed MPC scheme. Moreover we have shown 
that this approach may lead to a considerable improvement 
compared to the standard setup. Future research work con- 
cerns the detailed comparison analysis. Yet, we expect a 
better behavior of the solutions due to the more consistent 
computation of the control values. In particular the recovery 
time of the closed-loop system regarding disturbances will 
be a main issue. Another important aspect is development 
and analysis of retarded observers, at best with tunable rate 
of convergence. 
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